feat: support compile torchair graph while warming up #839

NeverRaR · 2025-05-13T16:48:39Z

What this PR does / why we need it?

feat: support compile torchair graph while warming up

Does this PR introduce any user-facing change?

How was this patch tested?

vllm_ascend/worker/model_runner_v1.py

vllm_ascend/core/scheduler.py

vllm_ascend/worker/model_runner_v1.py

wangxiyuan · 2025-05-29T11:14:35Z

vllm_ascend/worker/model_runner_v1.py

+                torch._logging.set_logs(recompiles=True)
+            self.torchair_graph_batch_sizes = additional_config.get(
+                "torchair_graph_batch_sizes", [])
+            if not isinstance(self.torchair_graph_batch_sizes, list):


we have enable_graph_mode to controller torchair, here named torchair_graph_batch_sizes, w'd better to use the same prefix. How about use torchair_graph for all case? cc @zzzzwwjj

#947 (comment)

vllm_ascend/worker/model_runner_v1.py

wangxiyuan · 2025-05-30T03:37:47Z

I added a new ascendconfig to deal with all additional_config #1029 . The comment I added before can be addressed later. Please fix the merge conflict, then the PR is ready to go IMO.

wangxiyuan · 2025-05-30T08:29:37Z

test_scheduler.py should be updated as well. for example lora_config has been removed

Yikun · 2025-05-30T16:44:25Z

export DEVICE=/dev/davinci0
export IMAGE=m.daocloud.io/quay.io/ascend/vllm-ascend:main
docker run --rm \
--name xxx-test \
--device $DEVICE \
--device /dev/davinci_manager \
--device /dev/devmm_svm \
--device /dev/hisi_hdc \
-v /usr/local/dcmi:/usr/local/dcmi \
-v /usr/local/bin/npu-smi:/usr/local/bin/npu-smi \
-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
-v /etc/ascend_install.info:/etc/ascend_install.info \
-v /root/.cache:/root/.cache \
-it $IMAGE bash

# Fetch the latest main
cd /vllm-workspace/vllm-ascend
git pull --rebase

# add upstream
git remote add upstream https://github.com/vllm-project/vllm-ascend.git

# add git alias
cat ~/.gitconfig
[alias]
	pr = "!f() { git fetch -fu ${2:-$(git remote |grep ^upstream || echo origin)} refs/pull/$1/head:pr/$1 && git checkout pr/$1; }; f"

# checkout 839 pr
git pr 839

# Run test
export VLLM_USE_MODELSCOPE=true
pytest -sv tests/singlecard/test_scheduler.py

I noticed you are trying to fix scheduler UT via frequetly changes, this might the effective way to reproduce and run test locally

Signed-off-by: boying <897013703@qq.com>

### What this PR does / why we need it? feat: support compile torchair graph while warming up Signed-off-by: boying <897013703@qq.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

### What this PR does / why we need it? feat: support compile torchair graph while warming up Signed-off-by: boying <897013703@qq.com>

### What this PR does / why we need it? feat: support compile torchair graph while warming up Signed-off-by: boying <897013703@qq.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

### What this PR does / why we need it? feat: support compile torchair graph while warming up Signed-off-by: boying <897013703@qq.com>

### What this PR does / why we need it? feat: support compile torchair graph while warming up Signed-off-by: boying <897013703@qq.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

NeverRaR force-pushed the dev/graph branch 9 times, most recently from a49f965 to 99be815 Compare May 14, 2025 08:29

NeverRaR force-pushed the dev/graph branch 8 times, most recently from 71634df to 44b77b9 Compare May 29, 2025 06:17

ganyi1996ppo reviewed May 29, 2025

View reviewed changes

vllm_ascend/worker/model_runner_v1.py Show resolved Hide resolved

ganyi1996ppo reviewed May 29, 2025

View reviewed changes

vllm_ascend/worker/model_runner_v1.py Show resolved Hide resolved

ganyi1996ppo reviewed May 29, 2025

View reviewed changes

vllm_ascend/worker/model_runner_v1.py Show resolved Hide resolved

NINGBENZHE mentioned this pull request May 29, 2025

[bugfix] Add ep initialization check and change the return check to is_driver_worker #896

Merged

ganyi1996ppo approved these changes May 29, 2025

View reviewed changes

NeverRaR force-pushed the dev/graph branch 2 times, most recently from 5caa186 to 42544de Compare May 29, 2025 09:34

wangxiyuan reviewed May 29, 2025

View reviewed changes

wangxiyuan mentioned this pull request May 29, 2025

[perf]Support MOE Multi-stream in Deepseek #947

Merged

NeverRaR force-pushed the dev/graph branch from 42544de to b46b329 Compare May 30, 2025 06:01

jianzs approved these changes May 30, 2025

View reviewed changes

NeverRaR force-pushed the dev/graph branch from b46b329 to 413b91a Compare May 30, 2025 06:06

NeverRaR force-pushed the dev/graph branch 3 times, most recently from bafde8a to 09b0d9d Compare May 30, 2025 08:21

NeverRaR force-pushed the dev/graph branch from 09b0d9d to fc7ee5c Compare May 30, 2025 09:06

github-actions bot added the module:tests label May 30, 2025

NeverRaR force-pushed the dev/graph branch from fc7ee5c to 52e0e99 Compare May 30, 2025 10:03

wangxiyuan added the ready read for review label May 30, 2025

NeverRaR force-pushed the dev/graph branch 7 times, most recently from 41e28c9 to 54f3a05 Compare May 30, 2025 16:26

NeverRaR force-pushed the dev/graph branch 4 times, most recently from 5f20f3c to 53679a8 Compare May 30, 2025 17:58

feat: support compile torchair graph while warming up

53679a8

Signed-off-by: boying <897013703@qq.com>

wangxiyuan merged commit 507ae62 into vllm-project:main May 30, 2025
23 checks passed

wangxiyuan mentioned this pull request Jun 3, 2025

fix: ascend_scheduler adapt v0.9.0 #1018

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support compile torchair graph while warming up #839

feat: support compile torchair graph while warming up #839

Uh oh!

NeverRaR commented May 13, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wangxiyuan May 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

wangxiyuan commented May 30, 2025

Uh oh!

wangxiyuan commented May 30, 2025

Uh oh!

Yikun commented May 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

feat: support compile torchair graph while warming up #839

feat: support compile torchair graph while warming up #839

Uh oh!

Conversation

NeverRaR commented May 13, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wangxiyuan May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wangxiyuan commented May 30, 2025

Uh oh!

wangxiyuan commented May 30, 2025

Uh oh!

Yikun commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wangxiyuan May 29, 2025 •

edited

Loading

Yikun commented May 30, 2025 •

edited

Loading